Ra-L/Icra 2020 - Guided Constrained Policy Optimization For Dynamic Quadrupedal Robot Locomotion